A Framework for Re-Optimization of Repetitive Queries

نویسندگان

  • Feng Yu
  • Wen-Chi Hou
  • Cheng Luo
  • Qiang Zhu
چکیده

Optimizing executions of queries is the ultimate goal of the query optimizer. Unfortunately, due to the complexities of queries, accuracy of statistics, validities of assumptions, etc., query optimizers often cannot find the best execution plans in their search spaces, conveniently called the optimal plans, for the queries. In this paper, we develop a comprehensive framework for re-optimization of a large and useful set of queries, called repetitive queries. Repetitive queries refer to those queries that are likely to be used repeatedly or frequently in the future. They are usually stored in the database for convenient reuses for the long term. They deserve more optimization efforts than ordinary ad hoc queries. In this research, we identify statistics, called sufficient statistics, that are sufficient to compute the exact intermediate results of all plans of a query. The finding of the sufficient statistics makes it entirely possible for an optimizer to find the truly best plan in its search space for a query. We present two innovative techniques to conduct re-optimization, an eager and a lazy re-optimization. The eager approach gathers all the sufficient statistics at once and generates the best plan. The lazy approach gathers only the statistics that are needed to correct large estimation errors found in the plan and generates a revised plan. We further adapt the two basic techniques to constantly changing database environments by continuously monitoring and revising the plans, called adaptive re-optimization. The adaptive reoptimization is devised to detect and remedy potential suboptimality in the plans in a timely manner for the entire lifetime of the query. We have also designed an approximate re-optimization method that provides an efficient and effective alternative to refining query execution plans. Our work realizes the promise made by the query optimizers, namely, executing queries in the optimal fashions, at least for the repetitive queries.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for Re-optimizing Repetitive Queries

In this paper, we develop a comprehensive framework forre-optimization of a large and useful set of queries, called repetitive queries. Repetitive queries refer tothosequeries that are likely to be used repeatedly or frequently in the future. Theydeserve more optimization efforts than ordinary ad hoc queries. In this research, weidentify statistics, called sufficient statistics,that are suffici...

متن کامل

Sample Trace: Deriving Fast Approximation for Repetitive Queries

Repetitive queries refer to those queries that are likely to be executed repeatedly in the future. Queries such as those used to generate periodic reports, perform routine summarization and data analysis belong to this category. Repetitive queries can constitute a large portion of the daily activities of a database system, and thus deserve extra optimization efforts. In this paper, we propose t...

متن کامل

Relational Databases Query Optimization using Hybrid Evolutionary Algorithm

Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...

متن کامل

A discrete-event optimization framework for mixed-speed train timetabling problem

Railway scheduling is a complex task of rail operators that involves the generation of a conflict-free train timetable. This paper presents a discrete-event simulation-based optimization approach for solving the train timetabling problem to minimize total weighted unplanned stop time in a hybrid single and double track railway networks. The designed simulation model is used as a platform for ge...

متن کامل

Scheduling Resource Usage in Object-Oriented Queries

Query facilities in object-oriented databases lag behind their relational counterparts in performance. This paper identifies important sources of that performance difference, the random I/O problem and the re-reading problem. We propose three techniques for improving the execution of object-oriented database queries: reuse/out of order execution, memoization, and buffer replacement policy. Sche...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010